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Abstract 

The Mallows measure on the symmetric group S„ is the probability measure such that each 
permutation has probability proportional to q raised to the power of the number of inversions, 
where q is a positive parameter and the number of inversions of tt is equal to the number of 
pairs i < j such that tt^ > ttj . We prove a weak law of large numbers for the length of the 
longest increasing subsequence for Mallows distributed random permutations, in the limit that 
n — >■ oo and g — >■ 1 in such a way that n(l — q) has a limit in R. 

Keywords: random permutations 
MCS numbers: 335 

1 Main Result 

There is an extensive literature dealing with the longest increasing subsequence of a random per- 
mutation. Most of these papers deal with uniform random permutations. Our goal is to study the 
longest increasing subsequence under a different measure, the Mallows measure, which is motivated 
by statistics [T6|. We begin by defining our terms and stating the main result, and then we give 
some historical perspective. 

The Mallows(n, q) probability measure on permutations S'„ is given by 

/i„,,({7r}) = (1.1) 

where inv(7r) is the number of "inversions" of tt, 

inv(7r) = #{(i,j) e {l,...,n}2 : z < j , ir, > tt^} . (1.2) 

The normalization is Z{n, q) ^ Y.^es„ See [8] for more background and interesting features 

of the Mallows measure. The measure is related to representations of the Iwahori-Hecke algebra as 
Diaconis and Ram explain. It is also related to a natural (j-deformation of exchangeability which 
has been recently discovered and explained by Gnedin and Olshanski [51 [TU] . 
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We are interested in the length of the longest increasing subsequence in this distribution. The 
length of the longest increasing subsequence of a permutation tt g S'„ is 

€(7r) = max{fc < n : tt^^ < • • • < 7r,j^ for some ii < ■ ■ ■ < i^} . (1-3) 

Our main result is the following. 

Theorem 1.1 Suppose that {qn)'^=i is a sequence such that the limit /3 = lim„_^oo ^^(1 — Qn) exists. 
Then 

Jiim ({^e5„ : |n-i/2£(7r) I <e}) = 1, 

for all e > 0, where 

{2/3-1/2 sinh-i(Ve'3 - 1) for /3 > 0, 

2 /or/3 = 0, (1.4) 

2|/3|-i/2 sin-i(Vl - e^) for/3<0. 

Remark 1.2 There is another formula for the function in\1.4\ as an integral in equation\7.2[ below. 



In a recent paper [?j Borodin, Diaconis and Fulman asked about the Mallows measure, "Picking 
a permutation randomly from Pg (their notation for the Mallows measure) , what is the distribution 
of the cycle structure, longest increasing subsequence, . . . ?" We answer the question about the 
longest increasing subsequence at the level of the weak law of large numbers. 

Note that the Mallows measure for q = I reduces to the uniform measure on S'„: 

for all TT £ Sn. For the uniform measure, Vershik and Kcrov [22] and Logan and Shepp [15] already 
proved a weak law of large numbers for the length of the longest increasing subsequence. We will 
use their result in our proof, so we state it here: 

Proposition 1.3 

lim Mn.i{^ e Sn : \n-^/^£{n) - 2| > e} = 0, (1.5) 

n— f oo 

for all e > 0. 

The reader can find the proof of this proposition in [22] and [15]. The proof of Vershik, Kerov, 
Logan and Shepp involved a deep connection in combinatorics known as the Robinson-Schensted- 
Knuth algorithm. In the RSK algorithm, for each permutation tt Cz Sn, one associates a pair of 
Young tableaux, with the same shape: a Young diagram or partition, A h n. Referring to this as 
A = sh(7r), the first row, Ai is the length of the longest increasing subsequence of tt. Vershik and 
Kerov considered the Plancherel measure on the set of partitions, where /Xn(A) is proportional to the 
number of permutations tt such that sh(7r) = A. Using the hook length formula of Frame, Robinson 
and Thrall for this probability measure, they defined the "hook integral" for limiting (rescaled) 
shapes of Young tableaux. They showed concentration of measure of the Plancherel measure, in 
the TV — > cx) limit, around the optimal shape of the hook integral. See, for example, |13] for many 
more details. 

The RSK approach was taken further by Baik, Dcift and Johansson 0]. In their seminal work, 
they gave a complete description of the fluctuations of Ai, which are similar to fluctuations of the 
largest eigenvalue in random matrix theory |21| . 
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For the Mallows measure, we do not believe that there is a straightforward analogue of the 
RSK algorithm. The reason for this is that the shape of the Young tableaux in the RSK algorithm 
does not determine the number of inversions. For progress on this difficult problem, see a result of 
Hohlweg [E]. 

On the other hand, long after the original proof of Vershik, Kerov, Logan and Shepp, there 
was a return to the original probabilistic approach of Hammersley . Hammersley had originally 
asked about the length of the longest increasing subsequence, as an example of an open problem 
with partial results, delived in an address directed primarily to graduate students. Using meth- 
ods from interacting particle processes, Aldous and Diaconis established a hydrodynamic limit of 
Hammersley's process, thereby giving an independent proof of Theorem 11.31 This approach was 
also developed by Seppalainen [15], Cator and Groeneboom [5], and others. This allows different 
generalizations than the original RSK proof of Vershik and Kerov. For instance, Deuschel and 
Zeitouni considered the length of the longest increasing subsequence for IID random points in the 
plane [Hl^. For us, their results are key. 

On the other hand if, contrary to our expectations, there is a generalization of the RSK algorithm 
which applies to Mallows distributed random permutations, that would allow the discoverer to try 
to extend the powerful methods of Baik, Deift and Johansson. 

The rest of the paper is devoted to the proof of Theorem 11.11 We begin by stating the key 
ideas. This occupies Sections 2 through 6. Certain important technical assumptions will be stated 
as lemmas. These lemmas are independent of the main argument, although the main argument 
relies on the lemmas. The lemmas will be proved in Sections 7 and 8. 

2 A Boltzmann-Gibbs measure 

In a previous paper }19j one of us proved the following result. 

Proposition 2.1 Suppose that the sequence {qn)^=i has the limit /? = lim„_>.oo '^■(1 — Qn)- For 
n G N, let 7r(aj) S Sn be a Mallows{n,qn) random permutation. For each n G N, consider the 
empirical measure pn{-,Oj) on R^, such that 



for each Borel set A C R^. Note that pn{-,uj) is a random measure. Define the non-random measure 
pp on R^ by the formula 



Then the sequence of random measures pn{-,uj) converges in distribution to the non-random measure 
pp, as n oo, where the convergence is in distribution, relative to the weak topology on Borel 
probability measures. 

We will reformulate Lemma |2.1[ using a Boltzmann-Gibbs measure for a classical spin system. 
The underlying spins take values in R^. We define a two body Hamiltonian interaction h : R^ — )■ R 

as 




dp nix, y) 



(/3/2)sinh(/3/2)l[o,i]2(x,y) 



■J dxdy . 



(2.1) 



cosh{l3[x - y]/2) - e-'3/4 cosh(^[a: + y- l]/2)) 



h{x,y) = l{xy < 0} . 
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Then the n particle Hamiltonian function is Hn '• (R^)^ — > R, 

^ n — 1 n 

Hn{{xi,yi), . . . ,{Xn,yn)) = ^^^"^ X] X! H^i ^ ^3 ' Vi ~ Vj) ■ 

One also needs an a priori measure a which is a Borel probability measure on R^. Given all this, 
the Boltzmann-Gibbs measure on (R^)" with "inverse-temperature" /? e R is defined as Hn,a,/3, 

, . . exp{- l3Hn{{xi,yi),...,{xn,yn))) l\"=ida{xi,yt) 
aMn.Q,/3((a;i,yij, . . . , (x„,y„)) = — — 

where the normalization, known as the "partition function" is 

P n 

Z„(a,/3) = / exp(-/3iJ„((xi,yi),...,(a;„,2/„))) ]Tda(xj,y,). 

Usually in statistical physics one only considers positive temperatures, corresponding to /3 > 0. But 
we will also consider /? < 0, because it makes mathematical sense and is an interesting parameter 
range to study. 

A special situation arises when the a priori measure a is a product measure of two one- 
dimensional measures without atoms. If A and k are Borel probability measures on R without 
atoms, then 

Ain,AxK,^i|((a;i,yi), • ■ • , ixn,yn)) e(R^)" : 3ii < ■ ■ ■ < ik , 

such that {x^. - x^^){yi. - J > for all j ^ ^| (2-2) 

= Mn.oxp(-^J/(n-l))({7'' G Sn : £(7r) > k}) , 

for each k. This follows from the definitions. In particular, the condition for an increasing sub- 
sequence of a permutation ii < • • • < is that if ij < ii then we must have tt^^ < tt^^. For the 
variables (xi, yi), . . . , (a;„, y„) replacing the permutation, we obtain the condition listed above. 

We will also use results from [6] by Deuschel and Zeitouni. They define the record length of n 
points in R^ as 

£((xi,yi), . . . , (x„,y„)) = max{fc : 3ii < ■ ■ ■ < ik , {xi^ -Xii){yi. -yi^) > for aU j < i} . (2.3) 

Equation (|2.2p says that the distribution of ^((Xi(cj), Yi(cj), . . . , X„(cj), y,i(a;))) with respect to 
the Boltzmann-Gibbs measure fJ-n,\XK,^ is equal to the distribution of £{t:{lo)) with respect to the 
Mallows(n, exp(-/?/(n - 1))) measure /iri,cxp(-/3/(ri-i))- 

Using the equivalence and Lemma 12.11 we may also deduce a weak convergence result for the 
measures Hn,\XK,p- In fact there is a special choice of measure for A and k, depending on /3, which 
makes the limit nice. 

For each /3 e R \ {0} define 

L{p) = [(l-e-^)//3]i/2, 
and define L(0) = 1. Define the Borel probability A^ on R by the formula 
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for /3 7^ 0, and dXo{x) — l[o,i](a;) dx. Also define a measure tr^ on by the formula 



Both the X and y marginals of ap are equal to the one-dimensional measure A^. Using this, the 
next lemma follows from Lemma |2. II and the strong law of large numbers. In fact, the strong law 
implies that an empirical measure arising from i.i.d. samples always converges in distribution to 
the underlying measure, relative to the weak topology on measures. 

Lemma 2.2 For n e N, let ((X„.i(a;), y„ ^(a;)), . . . , (X„.„(aj), y,i.„(aj))) be distributed according 
to the Boltzmann-Gibbs measure /^n.A^xA^.^, where we used the special a priori measure just con- 
structed. Define the random empirical measure (T„(-,a;) on T{? , such that 

1 " 

a„(A,w) := -^i{(x„,,H,r„,,(w)) e A}, 

i=i 

for each Borel measurable set A C R^. Then the sequence of random measures (o'„(-,w))^2 
converges in distribution to the non-random measure ap, in the limit n — >■ cx), where the convergence 
in distribution is relative to the topology of weak convergence of Borel probability measures. 

We could have also chosen a different a priori measure to obtain convergence to the same 
measure pp from Lemma 12.11 But we find the new measure ap to be a nicer parametrization. 
We may re-parametrize the measures like this by changing the a priori measure. The ability to 
re-parametrize the measures will also be useful later. 



3 Deuschel and Zeitouni's record lengths 

In [6], Deuschel and Zeitouni proved the following result. We thank Janko Gravner for bringing 
this result to our attention. 

Theorem 3.1 (Deuschel and Zeitouni, 1995) Suppose that u is a density on the box [01,02] x 
[&i,62]7 i-c, da{x,y) = u{x,y)dxdy is a probability measure on the box [01,02] x [&i,&2]- Also 
suppose that u is differ entiable in (01,02) x (61,62) o,nd the derivative is continuous up to the 
boundary. Finally, suppose there exists a constant c > such that 

u{x,y) > c, 

for all (x,y) G [01,02] x [61, &2]. Let (J7i, X^i), ([/2, V2), . • . be i.i.d., a-distributed random vectors in 
[ai , 02] X [5i , 62] • Then the rescaled random record lengths, 

n-^/HiiUi,Vi),...,{Un,Vn)), (3.1) 

converge in distribution to a non-random number J'* {u) defined as follows. Lei ([01,02] x [61,62]) 
be the set of all curves from (oi,6i) to (02,62) whose tangent line has positive (and finite) slope 
at all points. For j G Cy,([oi,02] x [61,62]) and any parametrization {x(t),y{t)), define 

J{u,l) = 2 ( ^u{x{t),y{t))x'{t)y'{t) dt . (3.2) 
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This is parametrization independent. Then 

J*[u) = sup J{u,-i). 

7eC^([ai,a2]x[hi,b2]) 

This is Theorem 2 in Dcuschel and Zcitouni's paper. The fact that 7) is parametrization 
independent is usefuL 

We generahze their definition of J'(u,^) a bit, attempting to mimic the definition of entropy 
made by Robinson and Ruelle in [T7] . This is useful for estabfishing continuity properties of J and 
it allows us to drop the assumption that u is differentiable. 

Given a box [ai,a2] x [61,62], we define n„([ai,a2] x [61,62]) to be the set of aU {n + l)-tuples 
r = ((a:o, 2/o), . . . , (xn, Vn)) e (R2)"+i satisfying 

oi = xo < • • • < a;„ = 02 and 61 = z/o < • • • < 2/n = 62 . 

We define 

Xk+l CVk+l \ 1/2 

u{x, y) dxdy \ . (3-3) 



k=o ''y 



For later reference, we note the following continuity property of J'{u^V) as a function of u for a 
fixed v. Suppose that u and v arc nonnegative functions in C([ai,a2] x [61,62]). Using the simple 
fact that [a — 6| < \J\a? — 6^1, for aU a, 6 > 0, we see that 



\J{u,r)-J{v,r)\ < 2^(/" \u{x,y)-v{x,y)\dxdy 



1/2 

\u{x,y) - v{x,y)\dxdy^ 

fc=0 •'Vk 

We define to be the supremum norm. Using this and the Cauchy inequality. 



\J{u,V) - J{v,V)\ <2\\u- ^ ^{xk+i-Xk){yk+i'Vk) 



Tl-l 

< ||w - ^(xfc+i - Xk + yk+i - Vk) 

k=Q 

1/2 



(3.4) 



= ||?i-u||^/^(a2-ai+62-6i). 
Now we state a technical lemma. 

Lemma 3.2 Lei S/<([ai, 02] x [61, 62]) be the set of all connected sets T d [oi, 02] x [61, 62] containing 
(oi, 61) and (a2, 62), and having the property that (xi — X2){yi — 2/2) > for all (xi, yi), {x2, 2/2) € T. 
Define n„(T) to be the set of all V = {{xo,yo), ■■ ■ , {xn,yn)) in n„ such that {xk,yk) G T for each 
k, and let n(T) = IJ^i n„(T). Finally, define 

J{u,T) - lim inf |:f(ii,7') ■ V e \J n„(T) , ||P|1 < e| . 

Then J^{u, •) is an upper semi- continuous function of B /i([ai, 02] x [61, 62]), endowed with the Haus- 
dorff metric. 
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If T is the range of a curve 7 G C^([ai,a2] x [61,62]), then J{u,T) = J{u,^) because for each 

partition V G n(T), the quantity J^{u,V) just gives a Riemann sum approximation to the integral 
in J{u,-f). 

Now, let us denote the density of as 

Then we may prove the following variational calculation. 
Lemma 3.3 For any T e 6/.([0, 

.L(/3) 2 

Jiup,T) < Y^dt = £(/3). 

Let us quickly verify the lemma in the special case (3 = 0. We have set L{0) = 1 and we know that 
Uq is identically 1 on the rectangle [0, 1]^. By equation p.4p . we know that 

J{uo,V) < 2, 

by comparing u = uq with v = 0. That means that J7(mo,T) < 2 for every choice of T. It is 
easy to see that taking T = {{t,t) : < t < 1}, which is the graph of the straight line curve 7 
parametrized by x{t) = y{t) — t ior < t < 1, 

J{uo,T) = J{uo,j) = 2 / VMx{t),y{t))x'{t)y'{t)dt = 2. 



^0 

Therefore, using Deuschel and Zeitouni's theorem, this shows that the straight line is the optimal 
path for the case of a constant density on a square. 

This lemma in general is proved using basic inequalities, as above, combined with the fact 
that i7(m,7) is parametrization independent, which allows us to reparametrize time for any curve 
(a;(t), y{t)). As with the other lemmas, we prove this in Section 7 at the end of the paper. 

4 Coupling to IID point processes 

Now, suppose that /3 is fixed, and consider a triangular array of random vectors in R^, 

{{Xn.k,Yn^k) ■■ neN, l<k<n),, 

where for each n G N, the random variables {XnA,Yn,i), • • • , (^n,n, Yn^n) are distributed according 
to the Boltzmann-Gibbs measure fJ'n,\f,x\f,,f3- We know that 

Mnxxp(-^i/(n-l)){^(7r) = fc} = P{£((X„,i,y„,i),...,(X„,„,r„^„)) = k}, 

for each k. We also know that the empirical measure associated to ((X„^i, F„_i), . . . , (X„_„, Yn^n)) 
converges to the special measure ct/j. It is natural to try to apply Deuschel and Zeitouni's Theorem 
13. 1[ even though the points (X„ 1, Yn i), . . . , (X„ „, Y^^n) are not i.i.d., a requirement for the random 
variables (C/i, Vi), . . . , (J7„, ¥„) of their theorem. 
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It is useful to generalize our perspective slightly. Let us suppose that A and k are general 
Borel probability measures on R without atoms, and let us consider a triangular array of random 
vectors in R^: {{Xn,k,Yn,k) ■ n S N, 1 < fc < n), where for each n g N, the random variables 
{Xn,i{uj) ,Yn,i{u!)) , . . . , (X„^„(cj), F„_„(ti;)) are distributed according to the Boltzmann-Gibbs mea- 
sure fin \XK f3- Let us define the random non-normalized, integer valued Borel measure w) on 
by ' 

71 

UAio) = ^i{(x„.,(w),y„.,H) e A}, (4.i) 

i=l 

This is a random point process. 

A general point process is a random, locally finite, nonnegative integer valued measure. We will 
restrict attention to finite point processes. Therefore, let X denote the set of all Borel measures 
^ on R^ such that ^{A) G {0, 1, . . . } for each Borel measurable set A C R^. Then, almost surely, 
is in X. In fact ^„(R^,w) is a.s. just n. For a general random point process, the total 
number of points may be random. 

Definition 4.1 Let Vn,\XK.i3 be the Borel probability measure on X describing the distribution of 
the random element ^ni' , ^) G X defined in (^T7|); where (X„^i(a;), y„,i(a;)), . . . , (X„_„(a;), y„,„(u;)) 
are distributed according to the Boltzmann-Gibbs measure fJ-n,\XK,/3- 

Given a measure € X, we extend the definition of the record length to 

£{£,) = maxjfc : 3{xi,yi), . . . , {xk,yk) S such that 

^{{{xi,yi), • ■ • , {xk,yk)}) > k and {xi - x.j){y^ - yj) > for ah i,j} . 

With this definition, 

£iU-,^)) = ^((x„,iH,y„,i(w)),...,(x„,„H,r„,„H)), (4.3) 

almost surely. 

There is a natural order on measures. If fi, v are two measures on R^, then let us say fi <v \i 
m(^) '^(^)i each Borel set A C R^. The function £ is monotone non-decreasing in the sense 
that if ^, C arc two measures in X then ^ < C ^ ^(0 — ^(C)- 

Lemma 4.2 Suppose that X and k, each have no atoms. Then for each n S N, the following holds. 

(a) There exists a pair of random point processes , defined on the same probability space, 
such that rjn < ^n, o,.s., and satisfying these conditions: ^„ has distribution Vn,\XK,pi there are 
i.i.d., Bernoulli-p random variables A'l, . . . , A'„, forp =^ cxp(— 1/3|), and i.i.d., \x k- distributed 
points iUi,Vi),...,iUK,+-+K„,VK,+-+Kj, such that iUA)=j:f=^"'^''" HiU^^V,) € A}. 

(b) There exists a pair of random point processes ^„,Cn; defined on the same probability space, 
such that ^ Cn; '^■S-, o,nd satisfying these conditions: ^„ has distribution Vn,\XK,t3', there are 
i.i.d., geometric-p random variables Ni, . . . ,Nn, forp = cxp(— and i.i.d., Xy. n- distributed 
points iUi,Vi),...,iUN,+-+N„,VN,+...+Nj, such that Cn{A) = 1{{U,,V,) e A}. 

We may combine this lemma with the weak law of large numbers and the Vershik and Kerov, 
Logan and Shepp theorem, to conclude the following: 
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Corollary 4.3 Suppose that {qn)^=i is a sequence such that lini„_i,oo n{l — Qn) — (3 R. Then, 
lim Mn,,„{7r e Sn : n-^/^£{7T) e (2e-l^l/2 _ 2el'5|/2 + e)} = 1 , 

for each e > 0. 

Let us quickly prove this corollary, conditional on previously stated lemmas whose proofs will 
appear later. 

Proof of Corollary 14. 3t Let /?„ be defined so that exp(— /?„/(n — 1)) = qn- Let tt e S'„ be 
a random permutation, distributed according to Hn.q,^, and let {(Xn.i^Yn.i), ■ ■ ■ , (Xn.mYn.n)) be 
distributed according to fJ.n.\XK.p„- ^6 have the equality in distribution of the random variables 

as we noted in Section 2, before. Note lim„_>(X3 ^^(l — qn) — implies that lim„_j.oo Pn — 

For a fixed n, we apply Lemma 14.21 but with /? replaced by to conclude that there are random 
point processes ?7„(-, w), Cti(-, lu) £ X defined on the same probability space J7, and separately, there 
are random point processes ^„(-, oj), C„(-, w) e X, defined on the same probability space, satisfying 
the conclusions of that lemma but with /? replaced by /3„. By (|4.3I) . we know that 

= m- 

By monotonicity of £, and Lemma 14.21 we know that for each k 

P{£(?7) > k} < P{£{0 > k} and P{^(0 > fc} < P{^(C) > k} . (4.4) 

Using equations (|2.2p and (|4.3p . this implies that for each e > 

tin,qA^ e Sn ■■ n-^/H{TT) < le^^^'^ + e} > P{n-i/2£(C) < le^^^'^ + e} , 
e ^„ : n-i/2^(7r) > 2e-l'3|/2 _ e} > P{n-i/2^(^) > 2e-l^l/2 - e} . 

Since the (C/^, Fi)'s end at i = /\i + • • • + A'„ or i = A^i + • • • + A^„ in the two cases, let us also 
define new i.i.d., A x K-distributed points (C/i, Vi) for all greater values of i. We assume these are 
independent of everything else. Then all ([7^, Vi) are i.i.d.. Ax k distributed. So, for any non-random 
number m G N, the induced permutation tt™ S Sm-, corresponding to ((C/i, Vi), . . . , (C/m, Vm)) is 
uniformly distributed. 

The random integers K\ , . . . , Kn and N\ , . . . , -/V„ from Lemma 14.21 are not independent of 
([/i, X^i), (C/2, V2), .... But, for instance, for any deterministic number m, conditioning on the event 
{oj e O : K\(uj) + • • • + Kniyi) < to}, we have that 

m < £{{UuV,),...,{U„,,V„,)), 

by using monotonicity of £ again. Therefore, for each n G N, and for any non-random number 
e N, we may bound 

P{n-i/2£(C) < 2el'3|/2 + e} > fij^j^ Jn G 5^,+ : n-^/^^W < 2el'3|/2 + e} 

- P({w e n : i^i(w) + • • • + /^„(w) > M+}) . 
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Similarly, for any non-random number Af„ , we may bound 

P{n-i/2^(C) > 2e-l^l/2 -e} > ^i,,-,^^ G S,,- : n^i/^^M > 2e-l'3|/2 - e} 

- P({w e 17 : 7Vi(cj) + • • • + iV„(w) < A/-}) . 

We choose S such that < (5 < e, and then we take sequences M+ = [n(e^l'^l + (5)J and A^,7 = 
[n(e'^'— (5)] . Since Ki, K2, ■ ■ ■ are i.i.d., Bernoulli random variables with mean e^'*^' , and Ni, N2, ■ ■ ■ 
are i.i.d., geometric random variables with mean e'*^', we may appeal to the weak law of large 
numbers to deduce 



lim F{{uj e n ; • ■+Kn{uj) > M+}) = lim P({w G n : Ni{uj)+- ■ •+A^„(w) < A/"}) = 0. 

Finally, by Proposition 11.31 we know that 
liminf ^{n € Sj^.j+ : n-^/^£{TT) < 2e^^^/^ + e} 



> liminf Aire 5^,+ : (Af +)-i/2£(7r) < 2 



and 

liminf il^r e 5*^- : ^-^/^^(Tr) > 26'^^^^^ ~ e} 



□ 



The bounds in CoroUarv 14.31 arc useful for small values of |/3|. For larger values of /3, they are 
useful when combined with the following easy lemma: 

Lemma 4.4 Suppose A and k have no atoms, and let the random point process ^ Cz X be distributed 
according to Vn.xxK.p- Suppose that R = [01,02] x [61,62] is any rectangle. Let ^ \ R denote the 
restriction of ^ to this rectangle: i.e., \ R){A) = ^{AnR). Note that this is still a random point 
process in X but one with a random total mass between and n. Then, for any m G {1, . . . ,n}, 
and any fc G {1, . . . , m}, we have 

P({€(e \R) = k}\ = m}) = M,„,j7r G 5™ : £{n) = k} , (4.5) 

for q = cxp(-^/ (m - 1)) . 

In order to use this lemma, we introduce an idea we call "paths of boxes." 



5 Paths of boxes 

We now introduce a method to derive Deuschel and Zeitouni's Theorem 13. II for our point process. 
For each n we decompose the unit square [0, 1]^ into sub-boxes 



i — 1 i 




p-1 .?] 




X 






n ' n 




n ' n 
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We consider a basic path to be a sequence (ii, ji ),..., (i2n-i,j2n-i) such that = (1,1), 

(j2n-i,j2r,-i) = in,n) and (ifc+i - ikjk+i - jk) equals (1,0) or (0,1) for each fc = 1, . . . , 2n - 2. 
In this case the basic path of boxes is the union IJ^"]^^ Rn{ik, jk)- Note that 

{ik+i- ik,jk+i- jk) = (1,0) ^ Rn{-ik,jk) <^ Rn{ik+i,jk+i) = {ik/n} X [{jk - l)/n,jk/n] , 
(ik+i - ik,jk+i ~ jk) = (0,1) ^ Rniik,jk) Rn{ik+i,jk+i) = [{ik - l)/n,ik/Ti]n {jk/n} . 

Now we consider a refined notion of path. We are motivated by the fact that Deuschel and 
Zeitouni's J{u, 7) function does depend on the derivative of 7. To get reasonable error bounds we 
must allow for a choice of slope for each segment of the path. So, given m € N and n & {2,3,...}, 
we consider a set of "refined" paths n„ „ to be the set of all sequences 

r := ((il, jl),ri, (i2,j2),?'2, («3,i3),''3, ■ • • , (j2ri-2, j2r,-2), ?'2ri-2, («2n- 1 , J2ri- 1 ) ) , 

where ((*i, Ji), (12, J2), • • ■ , (*2n-i, J2n-i) is a basic path, as described in the last paragraph, and 
ri, r2, . . . , r2„_2 sltc integers in {1, . . . , m} satisfying the additional condition: if ik ~ ik+i ~ ik+2 
or if jk = jk+i ~ jk+2 then rk+i > rk, for each k = l,...,2ri, — 3. We now explain the importance 
of this condition. 

Suppose that Rn{ik-, jk)^ Rn{ik+i, jk+i) = {ik/n} x [{jk- l)ln,jk/n\. Then we decompose this 
interval into m subintervals 



In.lX'i-k]jk,jk+l]r) = i — \ X 



jk - 1 _^ r-1 jk - 1 ^ 
n mn ' n m 



Similarly, if Rn{ik,jk) H Rn{ik+i,jk+i) = [{ik ~ l)/n,ik/n] x {jfc/n}, then we define 



»fc - 1 ^ r-l ik - 1 ^ 
n m ' n mn 



In either case, the choice of is which subinterval the "path" passes through in going from 

Rn{ik,jk) to Rn{ik+i,jk+i)- We define h to be InXiik] jk, jk+i;rk) or In}m{ik,ik+i] jk]rk) de- 
pending on which case it is. We also define (x^, yk) to be the center of the interval, either 

, . f^k jk-l , r-{l/2) \ f^k~l , r-{l/2) jk 

[Xk,yk) = — , + or {xk,yk) = ' 



^ n n mn J \ n mn n 

The additional condition that we require for a refined path just guarantees that Xk+i > Xk and 
Uk+i > yk for each k. 

We also define {ak,bk) G and {ck,dk) E to be the endpoints of the interval Ik- With 
these definitions, we may state our main result for paths of boxes. 

Lemma 5.1 Suppose that T € Hn.m is a refined path. Also suppose that E X is a point process 
with support in [0, 1]^, such that no point lies on any line {(x,y) : x = i/n} for i E Z or any line 
{{x, y) : y = j/n} for j e Z. Then 



> ^ [xk-i,Xk] X [yk^i^yk]), 



k=l 
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where we define {xo,yo) = (0,0) and {x2n-i,y2n-i) — (1,1)- Also, 

2n-l 

^iO < ^max \ [ak-i,Ck] x [bk-udk]) , 

fe=i 

where we define (ao,6o) = (Oj 0) '"^'^ (c2n-i7 c^2n-i) = (l;!)- 

We will prove this lemma in Section 8, after we have proved the other lemmas, since it requires 
several steps. 

Another useful lemma follows: 

Lemma 5.2 Suppose that u : [0, 1]^ — > R is a probability density which is also continuous. Then, 

2N-1 / y, \ 1/2 

max J(u,T) = 2 lim lim max > / / u(x,y) dx dy 

TeB/,([0,ll^) AT-^oo m^oo ren„,,„ ^ \J:,,_Jy,_, J 

We will prove this simple lemma in Section 7. With these preliminaries done, we may now 
complete the proof of the theorem. 

6 Completion of the Proof 

Suppose that /3 G R is fixed. At first we will consider a fixed sequence g„ = exp(— /3/(n — 1)), 
which does satisfy n(l — q„) — > /3 as n — > oo. Define the triangular array of random vectors 
in R^: ((Xn,fc, i^Ti,fe) : n G N, 1 < fc < n), where for each n G N, the random variables 
{Xn,i, Yn.i), • ■ • I {Xn^n, Yn,n) are distributed according to the Boltzmann-Gibbs measure /in,Axft,/3- 
Let ^„ G A' be the random point process such that 

n 



e„(A) = ^i{(x„,fe,r„,fc) G A}, 



k=l 

for each Borel measurable set A C R^. As we have noted before, we then have 

Mn,9„{7r G Sn : eiir) = fc} - P{£((X„,i, y„,i), . . . , (x„,„,r„,„)) = fc} 

= pwe») = fc}, 

for each fc. 

Now suppose that m,N gN are fixed. We consider "refined" paths in Iljv^rn. By Lemma [5. li 
which applies by first rescaling the unit square [0, 1]^ to [0,i(/3)]^, 

2JV-1 

i{U > max V £(e„ r [L{(3)xk-uL{l3)xk] x [i(/3)yfc„i, i(/3)yfc]) . (6.1) 
ren„,,„ 

The only difference is that we use the square [0, L{/3)]'^ in place of [0, 1]^. Also, 

2N-1 

£(e„) < max V £(e„ r [£(/3)afc_i,L(/3)cfc] X [i(/3)6fc_i,L(/3)dfc]). (6.2) 
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Now suppose that F g liN.m is fixed. Also consider a fixed sub- rectangle of F, 

Rk = [LiP)xk-uL{l3)xk] X [L(/3)j/fc_i,L(/3)yfc]. 

By Lemma |2.2[ we know that the random variables S,n{Rk)/n converge in probability to the non- 
random limit <Tp{Rk), as ri — > oo. Moreover, conditioning on the total number of points in the 
sub- rectangle ^n{Rk), Lemma tells us that 

PmCn \ Rk) = •} I iURk) = r}) = M.,9„{^ e Sr : £{7t) = .} . 

Note that the sequence of random variables ^„(_Ra:)(1 — qn) converges in probability to j3<7p{Rk) as 
n — >■ oo, because 

URk)il-qn) = n{l~q„)^^^, 

n 

and n(l — g„) — /3 as ri — )■ cxj. Therefore, using Corollarv 14.31 this implies for each e > 
lim P|^„(i?fe)"^/^^(e„ \ R)e (2e^'''^'*(^'=)/2_g^2e'''"^(^'=)/2 + e)\ ^ i. 

n—¥oo L J 

Since we have a limit in probability for ^n{Rk) /^^ we may then conclude for each e > that 



hm P|n-V2^(e„ ri?..0e(2[a,(i^fc)]l/^e-^-^(^'=)/2-6,2[a,(i?,)]V2e/^'^M«.)/2+,)^ = ^ 

n— ^oo L / 



This is true for each sub-rectangle Rk comprising F, and F is in lif^ m. But there are only 
finitely many sub-rectangles in F, and there are only finitely many possible choices of a refined path 
of boxes F G njv,Tn7 for N and m fixed. Combining this with ()6.ip implies that for any e > we 
have 



lim P <^ n-^'^l{^n) > max V 2[a«(i?fc)]i/2e-'3"'^(^'='/2 -A = 1. (6.3) 
I fc=i J 

By exactly similar arguments and (|6.2p we may also conclude that for each e > 

n-^/^ei^n)< max V 2[(7/j(i?^)]i/2e/^-MflD/2 + g I ^ 1 ^ (6_4) 
■ k=i ) 

where we define 

Rl = [L(/3)afe_i,L(/3)cfe],[L(/3)6fc_i,i(/3)4.], 

for each fc = 1, . . . , 2iV - 1. 

We apply Lemma 15.21 to up. For N fixed, taking the limit m — > oo, the area of the symmetric 
differences of the boxes R% and Rk converges to zero, uniformly in F G Ii.N,m for each k = 1, . . . , 2N— 
1. Since cr^ has a density, the same is true replacing area by a^-measure. Moreover, exp(— /3CT^(i?fc)) 
and exp(/3(7^(i?^)) converge to 1 uniformly as — > oo. Therefore, 



2N-1 

lim lim max V 2fcr«(i?fe)li/2g-/3^^(fl.)/2 
w-i-oo m-i-oo ren- - ' 



k=l 

2N-1 



hm lim max ^ 2[a0{Rl)]' 
' fc=l 

— max J(uR,T). 

TGB^([0,L(/3)]2) 



/2g-/i<T3(i?,t.)/2 (6-5) 
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Combined with (|6.3p and (|6.4p . this imphes that for each e > 0, 



hm P 



<e = 1 



J"'^'^(Cn)- max Jiui3,T) 
Tee^([o,L(/3)]2 

Finahy, we use Lemma [3.31 to conclude that 

max J{uB,T) < C{/3) . 

But taking T = {{t,t) : t G [0, which is the graph of the straight hne curve 7 € 

C>([0,L(/3)]2), gives 

i(/3) I 



This integral gives 

Thus, the proof is completed, for the special choice of ((?n) equal to (exp(— /?/ {n — 1))). Because 
the answer is continuous in /3, if we consider any sequence (g^) satisfying n(l — (/„) — > /?, then we 
get the same answer. All that is left is to prove all the lemmas. 



7 Proofs of Lemmas [Ol ID] and [O 

We now prove the lemmas, in an order which is not necessarily the same as the order they were 
stated. This facilitates using arguments from one proof for the next one. 
Proof of Lemma 13.21 Define 

j;(u,T) = M{J{u,V) : r e n(T), llT'll < e} 

for each e > 0. We first show that this function is upper semi-continuous. 

Let n„ denote n„([ai, 02] x [61, 62]). We remind the reader that this is the set of all (n+l)-tuples 
V = ((a;o,yo), • ■ • , ixn,yn)) G (R^)"+^ such that ai = xo < • • • < a;„ = a„ and h ^ ya < ■ ■ ■ < 
j/„ = &2- For each "P g n„, we have 

J{u,V)^Y.[ / u{x,y)dxdy\ . 

Since u is continuous, the mapping •) : n„ R is continuous when n„ has its usual topology 
as a subset of (R2)"+i. 

Consider a fixed path T e B /^{\a\,a-i\ x [&i,62]) and a partition V £ n(T) such that WVW < 
€. Note that there is some n such that V G n„(T). Suppose that (T''"'))5^L-^ is a sequence in 
By^{[ai, X [61,62]) converging to T in the Hausdorff metric. Then for each point {x,y) € T, 
there is a sequence of points (x'*''^ y^*^') € T^*^' converging to {x,y). Therefore, we may choose a 
sequence of partitions P''"'' G n„(T('^^) converging to V in n„. By the continuity mentioned above, 

lim J{u,r''''^) J{u,r). 

Also, IjT^'-'^-'ll converges to ||7-'|| which is less than e. So, for large enough fc, we have < e, and 

hence 
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since the right hand side is the infimum. Therefore, we see that 

Urn sup J,{u,T'^''^) < J{u,V). 

Since this is true for all V E n(T) with WVW < e, taking the infimum we obtain 

limsup J,(u,TW) < Je{u,T). 

Since this is true for every T £ By^{[ai,a2] x [61, 62]) and every sequence (T^'^^) converging to T in 
the Hausdorff metric, this proves that JTe{u, ■) is upper semi-continuous on yB/<([ai, 02] x [61, 62])- D 

Proof of Lemma 15. 2t The proof of this lemma is also used in the proof of Lemma 15751 This 
is the reason it appears first. 

Recall the definition of the basic boxes for i,j G {!,..., N}, 



RN{i,j) 



1 i 



N ' N 



J - 1 J_ 

N ' N 



Given N £ N, let us define utr and u,^ so that 



N 

N "' " 



(.x,y) = , , max,. . • li?iv(^J)(a;,^/) , 



. . , (a:',i/')G/?jv(ij) 



TV 

' * if' 11' \ (— N 1 1 t1 



By monotonicity, J{ujj, T) < J{u, T) < J{u'^, T) for every T G i5y ([0, 1]^). But since uj^ and 
are constant on squares, we know that the optimal T's for and arc graphs of rcctifiablc curves 
7 that are piecewise straight line curves on squares. This follows from the discussion immediately 
following the statement of Lemma [5751 where we verified the special case of that lemma for constant 
densities. The only degrees of freedom for such curves are the slopes of each straight line, i.e., where 
they intersect the boundaries of each basic square. 

For (.Tfc, j/fe), (a;fc+i, G i?Tv(*, j) representing two points on the boundary, such that Xk-i < 

Xk and yk-i < 2/fc, considering 7^ to be the straight line joining these points. 



Ju+{x{t),y{t))x'{t)y'{t)dt = y/ [xk - Xk-i){yk - yk-i) max y^u{x,y) , 

with a similar formula for u~ . This is a continuous function of the endpoints. We may approximate 
the actual optimal piecewise straight line path by the "refined paths" of boxes in nTv,m if we take 
the limit m — ^ cx) with N fixed. Therefore, we find that 



2TV-1 / y, 

max J'(utr,T) ~ lim max / / utr(x,y) dx dy 



1/2 
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Note that by upper semicontiiiuity, for each fixed N, the hmit as m — >■ cx) of the sequence 

max j / / u{x, y) dx dy ] 



also exists, and is the suprcmum over to e N. Therefore, we conclude that for each fixed £ N, 



2iV-l / \ 1/2 



nax I / / u{x,y)dxdy 



lim max 

fe=l \'' -i^fc-l !/fc-l 



max J(u,^,T), max J'(w1;,T) 
TeS/,([o,i]^) Teey([o,i]^) 

But taking N oo, we see that and ujj converge to u, uniformly due to the continuity of u. 
Therefore, by the bound from equation p.4p . the lemma follows. □ 



Proof of Lemma 13. 3t Suppose that x{t), y{t) is a parametrization of a curve 7 € 
C^([0, L(/3)]2). We may consider another time parametrization xi{t) = x{f{t)) and = y{f{t)) 
for a function /(t) such that 

xiit)yi{t) = 

Indeed, we obtain x{f{t))y{f{t)) = t^. Setting g{t) = x{t)y{t), our assumptions on x{t) and y(t) 
guarantee that g is continuous and g'{t) is strictly positive and finite for all t. We then take 

Since a change of time parametrization does not affect J'{uf},^)^ we will simply assume that 
x{t)y{t) ~ is satisfied at the outset. Then we obtain 



2 ^x'it)y' {t) 
1 -/3<2 



due to the formula for u^, and the fact that x{t)y{t) =t^= L'^(J3) at the endpoint of 7. Now since 
we have x{t)y{t) = t'^, that implies that 

x{t)y'{t)+y{t)x'{t) = 2t. (7.1) 

We know that x'(t) and y'{t) are nonnegative. Therefore, we may use Cauchy's inequality with e 

y^ww = wm'^'wit)]'^' < ix'{t) + i-y'it), 

for each e S (0, 00). Taking e = y{t)/t we get e^i = t/y{t) which is x{t)/t since we chose the 
parametrization that x(t)y{t) = t'^. Therefore, we obtain 

y{t)x'{t)+x{t)y'{t) 
2t 

Taking into account our constraint (|7.ip . this gives 
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Since this is true at all t e [0, L{f3)] this proves the desired inequality. But this upper bound gives 
the integral 

(7-2) 



which equals the formula for from (|1.4p . 

The argument works even if 7 is only piecewise C^, with finitely many pieces. Moreover, by the 
proof of Lemma 15. 2[ we know that the maximum over all T is arbitrarily well approximated by 
optimizing over piecewise linear paths. So the inequality is true in general. □ 

Proof of Lemma 14. 4t This lemma is related to an important independence property of the 
Mallows measure. Gnedin and Olshanski prove this in Proposition 3.2 of [TU], and they note that 
Mallows also stated a version in [TB]. Our lemma is slightly different so we prove it here for 
completeness. 

Using Definition 14.11 we can instead consider (Xi, Yi), . . . , y„) distributed according to 
Mn,AxK,^ in place of ^ distributed according to i'n,xxK,i3- Given to < n, we note that, conditioning on 
the positions of {Xm+i,Ym+i), ■ ■ ■ , {Xn, Yn), the conditional distribution of (Xi, Yi), . . . , {Xm, 
is the same as iJLm,a,i3' where /3' — {m ~ 1)/?/ {n — 1) and where a is the random measure 



daix,y) = cxpi ^ V h{ 



X - Xi,y- Yi)j dX{x) dK{y) , 

where Zi is a random normalization constant. By finite exchangeability of fJ-n,\XK,p it does not 
matter which m points we assume are in the square [ai, 02] x [61, 62] which is why we just chose the 
first TO. 

If wc could rewrite a as a product of two measures A', k' without atoms then we could appeal 
to (|2.2p . By inspection a is not a product of two measures. However, if we condition on the event 
that there are exactly m points in the square [ai, 02] x 62] then we can accomplish this goal. Let 
use define the event A = {(X,„+i, y„i+i), . . . , (X„, F„) ^ [ai, 02] x [61, 62]}- Then, given the event 
A, we can write 

'^[ai,a2]x[bub2]i^'y)dai^,y) = dX' (x) dn' (y) , (7.3) 
where A' and k' arc random measures 

dX'ix) = J-e-''''i(^)/("-i)dA(a;), dn'iy) = — e-^'''^y^^^''-^UKiy) , 
Z2 Z3 

with Z2 and Z3 normalization constants and random functions 

n 

hi{x) = ^ [l{y,<bjl(x,,oo)(a;) + l{Y;>b2}l(-oo,Xi)(a;)] , 

i—ra-\-l 

and 

n 

h2{y) ^ 5Z [l{X,<ai}lCK,,oo)(y) + l{X,>a2}l(-oo,y,)(^)] ■ 

'i— m+1 

This may appear not to reproduce a exactly because it may seem that hi and h2 double-count 
some terms which are only counted once in Yll^=m+i ^(^ ~ V ~ But this is compensated by 
the normalization constants Zi and Z2 as we now explain. 
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Note that for each i E {m + 1, . . . ,n} since {Xi,Yi) ^ [ai, 02] x [61, 62] we cither have Yi < bi, 
Yi > 1)2, Xi < ai or Xi > a2. These are not exclusive. But for instance, if Yi < bi and Xi < ai then 
for every {x,y) € [01,02] x [&i,&2], we have l{Yi<bi}'i-{Xi,oa){x) = 1 and l{Xi<ai}l(Yi,oo)(y) = 1- 
Therefore, these terms are constant in the functions hi{x) and /i2(y): they do not depend on the 
actual position of {x,y) as long as (x, y) € [01,02] x [61,62]- Therefore, using the normalization 
constants Zi and Z2, this double-counting may be compensated. 

Since we are conditioning on {(Xi, Yi), . . . , (Xm, Ym) G [oi, 02] x [61, 62]} and the event A, the 
conditional identity (|7.3p suffices to prove the claim. □ 



8 Proof of Lemma 14.21 

This is the most involved lemma to prove. It follows from a coupling argument. In fact we use the 
most basic type of coupling for discrete random variables, based on the total variation distance. 
See the monograph |14j (Chapter 4) for a nice and elementary exposition. But we also combine this 
with the fact that we have a measure which may be derived from a statistical mechanical model 
of mean field type. Because the model is of mean field type, the correlations are weak and spread 
out. In principle, this allows one to approximate by a mixture of i.i.d., points as one sees in de 
Finetti's theorem in probability, or the Kac-Lebowitz-Penrose limit in statistical physics. (See [1] 
for a reference on the former, and the appendix of [20] for the latter.) 

Given a probability measure a on R^, let 9i^a be the distribution on X associated to the random 
point process 

a(A^) = lAiX{Lj),Y{u;)), 
assuming {X{uj),Y{uj)) is a-distributed. 

Lemma 8.1 Suppose that a and a are two measures on such that a ^ a, and suppose that for 
some p e (0, 1] there are uniform bounds 

da n 
da 

Then the following holds. 

(a) There exists a pair of random point processes 771,^1, defined on the same probability space, 
such that r]i < ^i, a.s., and satisfying these properties: ^1 has distribution 0i^a; there is an 
a-distributed random point (Ui,Vi), and independently there is a Bernoulli-p random variable 
Ki, such that r]i{A) = KilAiUi, Vi). 

(b) There exists a pair of random point processes ^i,Ci; defined on the same probability space, 
such that ^1 < Ci, a.s., and satisfying these properties: ^1 has distribution ^1.5; there is a 
sequence of i.i.d., a-distributed points (Ui, Vi), {U2, V2), . . . and a geometric-p random variable 
N,, such that CM) = UiU,,Vi,). 

Proof: Let / = da/da. We follow the standard approach, for example in Section 4.2 of [13]. 
We describe it here in detail, in order to be self-contained. Define g{x) = (1 — p)^^[f{x) —p], which 
is nonnegative by assumption, and let a be the probability measure such that da /da = g. Note 
that a can be written as a mixture: a = pa + (1 — p)a. 
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Independently of one another, let {Ui,Vi) G be a-distributed, and let {Wi,Zi) £ be 
d-distributed. Independently of all that, also let Ki be Bernoulli-g. Then, taking 



{Ui,Vi) ifA'i = l, 
{Wi,Zi) otherwise , 



we see that {Xi,Yi) is a-distributed. We let ryi(A,w) = A'i(w)lA(f/i(a;), Vi(a;)). If Ki{uj) = 1 
then {Ui{uj),Vi{lj)) = {Xi{lu),Yi{uj)). Therefore taking ^i{A,n) = 1a{Xi{uj),Yi{uj)), we see that 
Vii'j^) ^ CiC'i"^): a-S- This proves (a). 

The proof for (b) is analogous. Let h{x) — {p^^ — l)^^[p^^ — f{x)], which is nonnegative 
by hypothesis. Let a be the probability measure such that da /da = h. Then a can be written 
as the mixture: a = pa + (1 — p)a. Independently of each other, let [Xi,Yi), (X2,l2), ■ • ■ be 
i.i.d., a distributed random variables, and let {Zi,Wi), (Z2, M^2), ... be i.i.d., a distributed random 
variables. Also, independently of all that, let Ki, ■ ■ ■ be i.i.d., BcrnouUi-q random variables. 
For each i, we define 



iX,,Y,) if/i, = 1, 
{Zi,Wi) ot her wise . 



Then (Ui, Vi), {U2, V2), . . . are i.i.d., a-distributed random variables. Let A^i = min{n : A'„ = 1}. 
We see that (Xn^Yn,) = (Un^Vn,). So clearly 1a{Xn„Yn,) < lA{Uk,Vk). □ 



Note that Ki and A^i are random variables which are dependent on {Ui, Vi), {U2, V2), .... But, 
for instance, conditioning on the event {A^i > i}, we do see that {Ui,Vi) is a-distributed. This is 
for the usual reason, as in Doob's optional stopping theorem: the event {iVi > i} is measurable 
with respect to the cr-algebra generated by Ki, . . . , Ki-i, while the point {Ui, Vi) is independent of 
that (7- algebra. This will be useful when we consider n > I, which is next. 



8.1 Resampling and Coupling for n > 1 

In order to complete the proof of Lemma 14.21 we want to use Lemma 18.11 More precisely we wish 
to iterate the bound for n > 1. Suppose that d„ is a probability measure on (R^)", and a is a 
probability measure on R^. Let 0„,a„ be the distribution on X associated to the random point 
process 

n 

UA^) = J2lA{Xk{io),Yk{u;)), 

k=l 

assuming {Xi{uj),Yi{u!)), . . . , (X„(a;), are d„-distributcd. 

If a„ was a product measure then it would be trivial to generalize Lemma 18.11 to compare it to 
the product measure a". But there is another condition which makes it equally easy to generalize. 
Let T denote the Borel cr-algebra on R^. Let J-"" denote the Borel cr-algebra on (R^)". Let 
J^J! denote the sub-cr-algebra of J"" generated by the maps . . . , (a;„,y„)) i-> {xj,yj) for 

j e {1, . . . ,n} \ {k}. We suppose that there are regular conditional probability measures for each 
of these sub-cr-algebras. Let us make this precise: 

Definition 8.2 We say that a„.fe : J- x (R^)" — > R is a regular conditional probability measure for 
an, relative to the sub-a-algebra J-J} if the following three conditions are met: 



19 



1. For each ((cci, yi), . . . , (a;„, y„)) g (R^)" the mapping 

A a„,fe(A; (.Ti,2/i), . . . , (a;„,y„)) 
defines a probability measure on T . 

2. For each A G J- , the mapping 

y^), . . . , (cc", y")) ^ a„,fc(A; (xi, yi), . . . , (a;„, y„)) 

is J-"" measurable. 

3. The measure an,k is a version of the conditional expectation E"" [• 1^^]- In this case this 
means precisely that for each Ai, . . . , An G 



an.k 



Ak; (Xi, Yi), . . . , (X„, y„)) n 1a^ {X,Xj 



ctniAi X • • • X A„) 



For p G (0, 1], we will say that q;„ satisfies the p-resampling condition relative to a if the following 
conditions are satisfied: 

• There exist regular conditional probability distributions otn^k relative to FJ^ for k = 1, . . . ,n. 

• For each {{xi,yi), (xn,yn)) G R-", and for each k ~ 1, . . . ,n, 

an,k{-;{xi,yi), . ■ . ,{xn,yn)) < a- 

• The following uniform bounds are satisfied for each ((xi, yi), . . . , (a;„, y„)) € R", and for each 
fc = 1, . . . , n: 

dQ;„,fe(-;(a;i,yi),...,(a;„,y„)) _i 

P < ^ < P ■ 

da 

Lemma 8.3 Suppose that for some p G (0, 1], the measure an satisfies the p-resampling condition 
relative to a. Then the following holds. 

(a) There exists a pair of random point processes I'jmS.n, defined on the same probability space, 
such that Tjn < ^ri, a.s., and satisfying these properties: ^„ has distribution 0n,a„! there 
are i.i.d., a -distributed points {{U^ ,Vi)}]^^^, and independently there are i.i.d., Bernoulli-p 
random variables Ki, . . . , Kn, such that rin{A) = J2^=i Kk^AiUi, Vi). 

(b) There exists a pair of random point processes ^„,Cn; defined on the same probability space, 
such that ^„ < C,n, a.s., and satisfying these properties: ^„ has distribution On.di„! there are 
i.i.d., a- distributed points {{U^,V^) : k ~ l,...,n, i ~ 1,2,...}, and i.i.d., geometric-p 
random variables Ni, . . . , Nn, such that Cn(^) = J^k^i '^A{Ui,V^'^). 

Proof: We start with an a„-distributed random point {{X^,Y^), . . . , (X^, )). Then itera- 
tively, for each k = 1, . . . ,n, we update this point as follows. Conditional on 



20 



we choose {X^,Yi^) randomly, according to the distribution 

We let r/) = r/"^) for each j e {1, . . . , n} \ {k}. With this resampling rule, we can 

see that {{X^ , (X^,y„'')) is a„-distributed for each k. Also {XJ},Y^) = {X^,Y^). 

We apply Lemma [8. II to each of the points {X'',Y''), in turn. Since they all have distributions 
satisfying the hypotheses of the lemma, this may be done. Note that by our choices, the various 
{Ul',V^''ys and K^s and iV^'s have distributions which are prescribed just in terms of p and a. 
Their distributions do not depend on the regular conditional probability distributions, as long as 
the hypotheses of the present lemma are satisfied. Therefore, they are independent of one another. 
□ 



Given the lemma, for part (a) we let {Ui,Vi), . . . , (Uki-^ h-R'„i^i<'iH hK„) be equal to the 

points {Ui,Vi) such that Kk = 1, suitably relabeled, but keeping the relative order. By the idea, 
related to Doob's stopping theorem, that we mentioned before, one can see that 

{Ui,Vi), . . . , {Uki+---+k„,Vki+---+k„) 

are i.i.d., a-distributed. We do similarly in case (b). This allows us to match up our notation with 
Lemma [4?2l The only thing left is to check that "p-resampling condition" for the regular conditional 
probability distributions is satisfied for Boltzmann-Gibbs distributions. 



8.2 Regular conditional probability distributions for the Boltzmann-Gibbs 
measure 

In Lemma [42l we assume that {{Xi, Yi), . . . , (X„, y„)) arc distributed according to the Boltzmann- 
Gibbs measure ^n,AxK,/3- Then we let h'n,\XK,p be the distribution of the random point process 
such that 



UA) = Y.lA{Xk,Yk) 



k=l 

In other words, the distribution fin.xxK.p corresponds to the distribution we have denoted On^a^ if 
we let a„ = axk /3- We take a ~ X x k. Now we want to verify the hypotheses of Lemma 18.31 for 
p = e-\P\. 

Referring back to Section [21 we sec that q;„ is absolutely continuous with respect to the product 
measure a". Moreover, 

^((a;i,yi),...,(a;„,y„)) = -^-^-^ cxp (3Hniixi,yi), . . . , {xn,yn))j ■ 

Here the Hamiltonian is 

^ n — l n 

Hniixi,yi),...,{xn,yn)) = ^ ^ H^i - Xj , yi ~ yj) . 

1=1 j=i+l 
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This leads us to define a conditional Hamiltonian for the single point (x, y) substituted in for [xk, yk) 
in the configuration . . . , (a;„,j/„)): 

1 " 

Hn,k{[x,y);{xi,yi),...,{xn,yn)) = ■:^—^'^K[x - Xj,y - yj) . 

With this, we define a measure a„.fe (•; (xi, yi), . . . , (a;„, j/n)), which is absolutely continuous with 
respect to a, and such that 

da„^fc(-; (a^l, yi), ■ ■ ■ , [Xn, yn)) ^ 1 g-/3ff„,j,((x,y);(a;i,yi),...,(x„,y„)) 

da '• Z„_fc (a, /3; (xi,yi ),..., (a;„,2/„)) 

The normalization is 

To sec that this is the desired regular conditional probability distribution, note that in the product 

dan,k{-\ ixi,yi), . . . , (x„,y„)) ^ ,dan , , 

1,2;, yj ^^^^ U'^^ii yijj • ■ • J l-^nj 

we have the product of two factors: 

1 



g-/3-fi"n,fc((a:,j/);(xi,yi),...,(2;„,y„)) 



^n,fc(a,/3; {xi.yi), . . . , (a;„,y„)) 
and ^ 

— — CXp( - l3Hn{{xi,yi), . . . ,{Xn,yn))] ■ 

The first factor does not depend on {xk, yk)- The second factor does depend on it, but integrating 
against da{xk,yk) gives, 

-/3ff„((xi,i;i),...,(a:„,y„)) Zn k (a, 0] (xi , yi) , . . . , (x^ , Vn) r, y , ^^ 

R2 Zn{a,(3) ' Z„(a,/3) 



where 

h: 

and we have 



^ n — 1 n 

^,fe((2:i,2/i),--->(2;«,2/«)) = ^^rryXl XI Kxi-Xj,yi-yj)., 



n 



H'nAi^i^yi)' • ■ • ' yn)) + Hn^k{{x, y); (xi, yi), . . . , (x„, y„)) 

= Hn{{xi,yi), {xk-i,yk-i), {x, y), (a;fc+i, y^+i), . . . , (a;„, y„)) 

Therefore, 

f dan,k{-,{xi,yi),...,{xn,yn)) , .dan,, s , , , . 

J ^ ^ {x,y) -j—;;;{{xi,yi), . . . , {Xn,yn)) da{xk,yk) 
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equals 

■^{{xi,yi), {xk-i,yk-i), {x,y), {xk+i,yk+i), {xn,yn)) ■ 

This imphes condition 3 in Definition [521 Conditions 1 and 2 are true because of the joint measur- 
abihty of the density, which just depends on the Hamihonian. 
Note that for any pair of points {x, y), (x', y'), we have 

\Hn,k{{x,y);{xi,yi),...,{xn,yn)) - Hn,k{{x',y');{xi,yi),...,{xn,yn))\ < 1, (8.1) 

because \h{x — Xj,y — yj) — h{x' — Xj, y' — yj)\ is either or 1 for each j, and Hn^k is a sum of n — 1 
such terms, then divided by n — 1. Wc may write 

l^ da^,k{-Ax,,y,l...,{xn,yn)) ^^^ y)^ = Z„.,(a, /3; {x^,y,), . . . , (a:„, y„))e^^"-«-^)^(-'^^)--(-'^"» 
as an integral 

^p[Hn,k{{x,y)\ixi,yi},...,{x„,y„))-H„,k{{x' ,y')-{xi,yi),...,{x„.,y„))] ^'-j 

Therefore, the inequality ()8.1|) implies that 



-\B\^fd^nM{-;{xi,yi),...,{Xn,yn)), A ^ |«| 

1 da Hx,y) I < e"^! . 

Of course, this implies the same bounds for the reciprocal. For all {x, y) G R^, 

-1/31 / dan^k{-;{xuyi), ■ . ■ ,{Xn,yn)) , . ^ l«| 

da 

So, taking p = e~l^l, this means that the hypotheses of Lemma 18.31 are satisfied: q;„ has the 
"p-resampling" property relative to the measure a. Hence, we conclude that Lemma 14.21 is true. 
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